Lip-reading based on a fully automatic statistical model
نویسندگان
چکیده
In this paper, we describe audiovisual automatic speech recognition experiments carried using visual parameters extracted from “natural” images. Unlike many other experiments in the AV ASR field, these visual parameters are obtained without any hand-labeling phase and are naturally noisy, due to the extraction process. We evaluate our models with different strategies among which : use of a shape model combined with or after an appearance model. For audiovisual parameters integration, we use a basic DI architecture with a fixed weight. We use a new evaluation criterion to measure the quality of parameters which proves to be efficient, and aim to use it in the near future, for an adaptive weighting scheme.
منابع مشابه
Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods
For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...
متن کاملAutomatic Hybrid Approach for Lip POI Localization: Application for Lip-reading System
Automatic Lip-reading system is one of the different assistive technologies for hearing impaired or elderly people. We can imagine, for example, a dependent person ordering a machine with an easy lip movement or by a simple visemes (visual phoneme) pronunciation. The need for an automatic lip-reading system is ever increasing. The lip-reading system is decomposed in three subsystems, first we h...
متن کاملManaged Pressure Drilling Using Integrated Process Control
Control of wellbore pressure during drilling operations has always been important in the oil industry as this can prevent the possibility of well blowout. The present research employs a combination of automatic process control and statistical process control for the first time for the identification, monitoring, and control of both random and special causes in drilling operations. To this end, ...
متن کاملAutomated Gesturing for Embodied Animated Agent: Speech-driven and Text-driven Approaches
We present two methods for automatic facial gesturing of graphically embodied animated agents. In one case, conversational agent is driven by speech in automatic Lip Sync process. By analyzing speech input, lip movements are determined from the speech signal. Another method provides virtual speaker capable of reading plain English text and rendering it in a form of speech accompanied by the app...
متن کاملAutomatic Lip Reading for Daily Indonesian Words Based on Frame Difference and Horizontal-vertical Image Projection
Automatic lip reading is one of research being developed lately. Automatic lip reading has been used for various purposes, such as enhancing speech recognition and aid to speech training for the deaf. There are two approaches in lip feature extraction, namely appearance based and shape based. Appearance based approach is usually better, because it provides visual features that cover not only li...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002